Topic identification challenge
نویسندگان
چکیده
منابع مشابه
Topic Identification
Unter Topic-Identifikation versteht man die Generierung sinnvoller und ausdrucksstarker Kurzbeschreibungen bzw. Label für Gruppen von Dokumenten. Topic-Identifikation spielt eine Schlüsselrolle in allen Anwendungen, in denen unüberwacht Kategorien, also Gruppen von Dokumenten gebildet werden: Eine automatisch erstellte Dokumentkategorisierung ist wertlos, wenn es nicht gelingt, Kategoriebezeich...
متن کاملTopic Identification
In this chapter we discuss the problem of identifying the underlying topics beings discussed in spoken audio recordings. We focus primarily on the issues related to supervised topic classification or detection tasks using labeled training data, but we also discuss approaches for other related tasks including novel topic detection and unsupervised topic clustering. The chapter provides an overvi...
متن کاملTopic Identification in Discourse
This paper proposes a corpus-based language model for topic identification. We analyze the association of noun-noun and noun-verb pairs in LOB Corpus. The word association norms are based on three factors: 1) word importance, 2) pair co-occurrence, and 3) distance. They are trained on the paragraph and sentence levels for noun-noun and nounverb pairs, respectively. Under the topic coherence pos...
متن کاملKnowledge-Based Automatic Topic Identification
As the first step in an automated text summarization algorithm, this work presents a new method for automatically identifying the central ideas in a text based on a knowledge-based concept counting paradigm. To represent and generalize concepts, we use the hierarchical concept taxonomy WordNet. By setting appropriate cutoff values for such parameters as concept generality and child-to-parent fr...
متن کاملTopic Identification: Framework and Application
This paper is on topic identification, i. e., the construction of useful labels for sets of documents. Topic identification is essential in connection within categorizing search applications, where several sets of documents are delivered and an expressive description for each category must be constructed on the fly. The contributions of this paper are threefold. (1) It presents a framework to f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Scientometrics
سال: 2017
ISSN: 0138-9130,1588-2861
DOI: 10.1007/s11192-017-2307-0